Effect of Global Visual Features on Scene Segmentation for Videos
نویسندگان
چکیده
Automatic video segmentation is the first and necessary step for organizing a long video file into smaller units for subsequent browsing and retrieval. The smallest basic unit is shot–a contiguous sequence of frames recorded from a single camera operation. Relevant shots are then grouped into a high-level unit called scene that conveys some meaning to viewers. In this paper, we first give a stricter scene definition. We then investigate the performance of two recent scene segmentation techniques. Both utilize visual features of entire video frames for grouping shots into scenes. Our experimental results on two full-length films reveal an interesting insight, suggesting a better approach for scene segmentation.
منابع مشابه
Traffic Scene Analysis using Hierarchical Sparse Topical Coding
Analyzing motion patterns in traffic videos can be exploited directly to generate high-level descriptions of the video contents. Such descriptions may further be employed in different traffic applications such as traffic phase detection and abnormal event detection. One of the most recent and successful unsupervised methods for complex traffic scene analysis is based on topic models. In this pa...
متن کاملCompressed Domain Scene Change Detection Based on Transform Units Distribution in High Efficiency Video Coding Standard
Scene change detection plays an important role in a number of video applications, including video indexing, searching, browsing, semantic features extraction, and, in general, pre-processing and post-processing operations. Several scene change detection methods have been proposed in different coding standards. Most of them use fixed thresholds for the similarity metrics to determine if there wa...
متن کاملCompressed-Sampling-Based Image Saliency Detection in the Wavelet Domain
When watching natural scenes, an overwhelming amount of information is delivered to the Human Visual System (HVS). The optic nerve is estimated to receive around 108 bits of information a second. This large amount of information can’t be processed right away through our neural system. Visual attention mechanism enables HVS to spend neural resources efficiently, only on the selected parts of the...
متن کاملAutomated Video Segmentation for Lecture Videos: A Linguistics-Based Approach
Video, a rich information source, is commonly used for capturing and sharing knowledge in learning systems. However, the unstructured and linear features of video introduce difficulties for end users in accessing the knowledge captured in videos. To extract the knowledge structures hidden in a lengthy, multi-topic lecture video and thus make it easily accessible, we need to first segment the vi...
متن کاملText Driven Temporal Segmentation of Cricket Videos
In this paper we address the problem of temporal segmentation of videos. We present a multi-modal approach where clues from different information sources are merged to perform the segmentation. Specifically, we segment videos based on textual descriptions or commentaries of the action in the video. Such a parallel information is available for cricket videos, a class of videos where visual featu...
متن کامل